Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
نویسندگان
چکیده
VQA is an ambitious task aiming to answer any image-related question. However, in reality, it hard build such a system once for all since the needs of users are continuously updated, and has implement new functions. Thus, Continual Learning (CL) ability must developing advanced systems. Recently, pioneer work split dataset into disjoint sets study this topic. CL on involves not only expansion label (new Answer sets). It crucial how questions when deploying systems environments Visual scenes) requiring functions Question types). we propose CLOVE, benchmark On quEstion answering, which contains scene- function-incremental settings two aforementioned scenarios. In terms methodology, main difference between classification that former additionally expanding preventing forgetting reasoning mechanisms, while latter focusing class representation. real-data-free replay-based method tailored VQA, named Scene Graph as Prompt Symbolic Replay. Using piece scene graph prompt, replays pseudo graphs represent past images, along with correlated QA pairs. A unified model also proposed utilize current replayed data enhance its ability. Finally, experimental results reveal challenges CLOVE demonstrate effectiveness our method. Code available at https://github.com/showlab/CLVQA.
منابع مشابه
Continual Learning with Deep Generative Replay
Attempts to train a comprehensive artificial intelligence capable of solving multiple tasks have been impeded by a chronic problem called catastrophic forgetting. Although simply replaying all previous data alleviates the problem, it requires large memory and even worse, often infeasible in real world applications where the access to past data is limited. Inspired by the generative nature of th...
متن کاملthe effect of teaching reading as whole text task and paragraph task on efl learners speaking
abstract the present study was conducted to investigate the effectiveness of a rather newly developed method in language teaching that is the use of paragraph reading task and whole reading task and their effects on improving efl learner’s speaking ability. to fulfill the purpose of the study, first 90 participants studying their course at sa eedi high school in tehran were chosen by means of ...
15 صفحه اولGraph Structure Learning for Task Ordering
In many practical applications, multiple interrelated tasks must be accomplished in sequential order through user interactions with multiple retrieval, classification and recommendation systems. The ordering of the tasks may have a significant impact on the overall utility (or performance); hence optimal ordering of tasks is desirable. However, manual specification of near-optimal ordering is o...
متن کاملReplay Scene Based Sports Video Abstraction
Video abstraction can be useful in multimedia database indexing and querying and can illustrate the important content of a longer video to quick browsing. Further, in sports video, replay scene often demonstrates the highlight of the video. The detection of replay scene in the sports video is a key clue to sports video summarizing. In this paper, we present a framework of replay scene based vid...
متن کاملScene Graph Parsing as Dependency Parsing
In this paper, we study the problem of parsing structured knowledge graphs from textual descriptions. In particular, we consider the scene graph representation (Johnson et al., 2015) that considers objects together with their attributes and relations: this representation has been proved useful across a variety of vision and language applications. We begin by introducing an alternative but equiv...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i1.25208